Learning in Lines of Action

نویسندگان

  • Mark H. M. Winands
  • Levente Kocsis
  • Jos W. H. M. Uiterwijk
  • H. Jaap van den Herik
چکیده

This paper investigates to what extent learning methods are beneficial for the Lines of Action tournament program MIA. We focus on two components of the program: (1) the evaluation function and (2) the move ordering. Using temporal difference learning the evaluation function was improved by tuning the weights. We found substantial improvements for three weights. The move ordering was enhanced by the Neural MoveMap (NMM) heuristic, which is based on learning. The two learning techniques improved both the playing quality and the speed of the program. Test results are given. The new evaluation function improved the program with a winning ratio of 1.68. The speed up of the NMM heuristic is 17 percent.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Study on gene action and combining abilities for thermotolerant ablilities of corn (Zea mays L.)

High temperature reduces the pollen viability and silk receptivity of corn resulting in poor seed set and reduced yield. Continuously increasing temperature and less frequency and distribution of rainfall coupled with usual canal–closure particularly in Pakistan have significantly been decreasing the grain yield. This problem could be overcome by developing heat tolerant maize hybrids. For this...

متن کامل

The Impact of Studio-based learning on Metacognition and Design Ability of Architecture Students - Action Research

Proper training can put design learners in the right direction. It also enhances the power of drawing. Objective of this study was the effectiveness of architectural studio-based learning on increasing drawing power and metacognition abilities of students. This research seeks to answer these questions: Can architectural studio-based learning increase student design ability? Can architectural st...

متن کامل

Estimation of Combining Ability and Gene Action for Agro-Morphological Characters of Rapeseed (Brassica Napus L.) Using Line×Tester Mating Design

Combining ability effects were estimated for different agronomic characters in line × tester crossing program comprising 21 hybrids produced by crossing 7 lines and 3 testers. Parents and hybrids differed significantly for general combining ability (GCA) and specific combining ability (SCA) effects, respectively. The variance due to GCA and SCA showed that gene action was predominantly additive...

متن کامل

Estimation of Combining Ability and Gene Action for Agro-Morphological Characters of Rapeseed (Brassica Napus L.) Using Line×Tester Mating Design

Combining ability effects were estimated for different agronomic characters in line × tester crossing program comprising 21 hybrids produced by crossing 7 lines and 3 testers. Parents and hybrids differed significantly for general combining ability (GCA) and specific combining ability (SCA) effects, respectively. The variance due to GCA and SCA showed that gene action was predominantly additive...

متن کامل

On-Line Learning of a Persian Spoken Dialogue System Using Real Training Data

The first spoken dialogue system developed for the Persian language is introduced. This is a ticket reservation system with Persian ASR and NLU modules. The focus of the paper is on learning the dialogue management module. In this work, real on-line training data are used during the learning process. For on-line learning, the effect of the variations of discount factor (g) on the learning speed...

متن کامل

On-Line Learning of a Persian Spoken Dialogue System Using Real Training Data

The first spoken dialogue system developed for the Persian language is introduced. This is a ticket reservation system with Persian ASR and NLU modules. The focus of the paper is on learning the dialogue management module. In this work, real on-line training data are used during the learning process. For on-line learning, the effect of the variations of discount factor (g) on the learning speed...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002